Quantitative method for modeling context in concatenative synthesis using large speech database
نویسندگان
چکیده
Modeling phonetic context is one of the key points to get natural sounding in concatenative speech synthesis. In this paper, a new quantitative method to model context has been proposed. In the proposed method, the context is measured as the distance between leafs of the top-down likelihood-based decision trees that have been grown during the construction of acoustic inventory. Unlike other context modeling methods, this method allows the unit selection algorithm to borrow unit occurrences from other contexts when their context distances are close. This is done by incorporating the measured distance as an element in the unit selection cost function. The motivation behind this method is that it reduces the required speech modification by using better unit occurrences from near context. This method also makes it easy to use long synthesis units, e.g. syllables or words, in the same unit selection framework.
منابع مشابه
Data-driven Segment Pres Trainable Speech Syn
Unit selection based concatenative speech synthesis has proven to be a successful method of producing high quality speech output. However, in order to produce high quality speech, large speech databases are required. For some applications, this is not practical due to the complexity of the database search process and the storage requirements of such databases. In this paper, we propose a data-d...
متن کاملOptimising selection of units from speech databases for concatenative synthesis
Concatenating units of natural speech is one method of speech synthesis. Most such systems use an inventory of xed length units, typically diphones or triphones with one instance of each type. An alternative is to use more varied, non-uniform units extracted from large speech databases containing multiple instances of each. The greater variability in such natural speech segments allows closer m...
متن کاملConcatenative arabic speech synthesis using large speech database
Speech synthesis has got a lot of research interest as it represents an important part in a complete text-to-speech system. In this paper, an Arabic speech synthesis system has been proposed. The proposed system belongs to the family of concatenative speech synthesis systems that use large speech database. The concatenation unit inventory has been automatically constructed from a pre-recorded o...
متن کاملA concatenative speech synthesis method using context dependent phoneme sequences with variable length as search units
This paper proposes a new concatenative speech synthesis method using context dependent phoneme sequences with variable length as search units. Using Japanese broadcast news programs as a speech database, we synthesize Japanese news sentences that are not included in that speech database and perform subjective evaluations of the synthesized speech. As a result, (1) 77% of speech synthesized by ...
متن کاملA System for Data-driven Concatenative Sound Synthesis
In speech synthesis, concatenative data-driven synthesis methods prevail. They use a database of recorded speech and a unit selection algorithm that selects the segments that match best the utterance to be synthesized. Transferring these ideas to musical sound synthesis allows a new method of high quality sound synthesis. Usual synthesis methods are based on a model of the sound signal. It is v...
متن کامل